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ABSTRACT 


A  general  purpose  7030  computer  program  has  been  written  in  the 
FORTRAN  environment  to  merge,  in  a  single  run  of  the  program,  two, 
three,  or  four  tape  files  which  are  the  resulting  output  from  a  sort  program. 
These  files  may  be  more  than  one  tape  in  length  and  may  be  either  FORTRAN 
or  non- FOR TRAN  formatted.  Input  and  output  processors  have  been  de¬ 
signed  to  handle  various  types  of  item  formats;  the  program  aeeepts  items 
of  fixed  length,  variable  length  (identified  by  a  predesigned  terminator)  and 
variable  length  with  a  length- defining  field.  Upon  request,  the  output 
processor  will  eliminate  duplicate  items  and/or  fill  in  variable  length 
reeords  to  a  standard  size. 
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SECTION  I 


INTRODUCTION 

A  general  purpose  7030  computer  program,  MERGE,  has  been  written  in 
the  FORTRAN  environment  to  merge  in  a  single  run  of  the  program  two, 
three,  or  four  tape  files  which  are  the  resulting  output  from  a  sort 
program.  These  files  may  be  more  than  one  tape  in  length  and  may  be 
either  FORTRAN  or  non-FORTRAN  formatted.  Input  and  output  processors 
have  been  designed  to  handle  various  types  of  item  formats;  the  pro¬ 
gram  accepts  items  of  fixed  length,  variable  length  (identified  by  a 
predesignated  terminator)  and  variable  length  with  a  length-defining 
field.  Upon  request,  the  output  processor  will  eliminate  duplicate 
items  and/or  fill  in  variable  length  records  to  a  standard  size. 

The  major  functions  of  the  program  are  illustrated  in  Figure  1.  The 
flow  as  shown  is  only  an  introduction  to  the  general  functions  of  the 
program;  the  actual  flow  varies  according  to  the  intermixing  of 


INITL  INITL  and  Machine  MERGE,  INEDIT  and 

language  subprogram  OTEDIT 


Expansion  of  the  merging  process  is  illustrated  below. 
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mainstream  and  autostacked  modes.  The  relative  positions  of  the  pro¬ 
grams  is  of  major  importance  since  some  of  the  programs  are  eventually 
destroyed;  therefore,  a  knowledge  of  core  layout  is  essential.  A 
description  of  the  subroutines  -  and  the  order  in  which  they  should 
be  arranged  -  is  provided  in  the  section  entitled  SUBROUTINES. 
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SECTION  II 


PROGRAM  FUNCTIONS 

The  first  function  of  the  program  MERGE  is  to  find  data  for  each  of  the 
(four)  files.  One  input  is  preselected  from  each  channel  before  the  program 
proceeds.  This  process  requires  that  tables  be  structured  to  keep  up  with 
the  correct  address  and  to  modify  the  appropriate  address  of  the  last  file 
for  each  channel  (LSTA-D).  This  section  describes  in  detail  the  basic 
table  structures  in  the  program  and  the  conditions  under  which  they  are 
modified . 

Input  Channel  Tape  (ICT)  Tables 

The  table  word  format  is  described  in  the  paragraph  on  common  storage 
tables;  the  indicators  included  in  this  table  and  the  functions  they  per¬ 
form  are  described  below. 

The  channel  BUSY  bit  is  turned  on  only  if  there  is  a  read  or  write  being 
performed  on  the  channel.  The  contents  of  the  control  word  reflect 
the  I/O  operation  currently  on  the  channel;  bit  27  of  the  control  word 
is  1  if  this  is  a  write  operation;  a  0  if  it  is  a  read  operation. 

The  address  for  the  last  file  of  each  item  is  located  in  LSIA-D  (words 
4,  5,  6,  and  7)  which  point  to  “HIKEY"  -  which  contains  a  floating 
point  mantissa  of  ones  -  whenever: 
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input  does  not  exist,  e  g  ,  input  D  during  a  two  or  three  way 
merge  is  blank 

the  specific  input  file  has  been  depleted 

the  tape  containing  this  input  file  is  connected  to  another 
I/O  channel 

the  channel  is  already  reading  that  file. 

Since  HLKEY  is  greater  than  any  other  key,  the  input  is  not  selected 
The  INI.TL  subroutine  clears  the  tables  ;  computes  the  number  of  input 
files  not  on  this  channel,  called  OTHER,  in  bits  32-49  of  word  0; 
finds  the  channel  I/O  queue  address  (QP)  ;  and  points  to  HIKEY. 

Conditions  Under  Which  IGT  Tables  Are  Modified 

The  BUSY  bit  is  set  to  1  when  a  read  or  write  operation  is  begun  and 
to  0  before  exiting  from  the  read  or  write  EOP.,  unless  the  next  read 
or  write  is  begun  there e  OTHER  is  adjusted  whenever  a  tape  swap  in¬ 
volves  a  channel  change. 

The  control  word  (word  1)  must  be  filled  with  the  word  fetched  from 
the  top  of  the  channel  I/O  queue  before  a  read  or  write  is  begun,  this 
word  becomes  the  read  or  write  control  word. 

The  OP  word  should  always  point  to  the  top  of  the  channel's  I/O  queue; 
its  count  field  (word  3,  bits  28  -  45)  must  contain  the  number  of  entries 
currently  on  that  queue 
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After  an  input  is  selected  for  reading,  its  LST  must  be  pointed  to  HIKEY. 
At  read  EOP  time,  the  LST  must  point  to  the  core  address  of  the  last 
item  in  the  buffer  just  filled.  This  action  permits  a  procedure 
called  “preselection1'  to  pick  the  next  read  to  queue  by  deciding  which 
of  the  inputs  on  that  channel  will  be  depleted  first. 

Input/Qutput  Queue  Tables 

There  are  four  queues,  one  for  each  data  channel  assignment  now  possible 
and  one  for  future  expansion.  The  I/O  queues  are  in  a  one-to-one 
correspondence  with  the  1/0  channel  tables.  The  I/O  queue  table  is 
accessed  via  the  I/O  queue  pointer,  QP ,  of  the  corresponding  I/O  channel 
table  The  count  field  of  QP  contains  the  number  of  entries  currently 
in  the  queue  (word  3,  bits  24  -  43  of  ICT  tables). 

The  subroutine  INITL  computes  the  necessary  size  of  the  I/O  queue  tables, 
which  is  the  total  number  of  input  buffers  allocated  for  the  merge 
plus  the  total  number  of  output  buffers  allocated  for  the  merge  plus  1. 
The  format  varies  depending  upon  the  content;  if  the  queue  is  empty, 
all  words  are  zero,  if  the  queue  is  not  empty  the  entries  have  two 
forms:  read  requests  and  write  requests. 

Read  Requests  XW,  IB,  L,  FUTf 
where  XW  =  index  word 

IB  =  input  buffer 
L  =  length 

FUTf  =  input  file  pointers,  2_<_  f  <  3 
bit  27=0 
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When  the  next  input  is  selected,  either  at  read  EOP  time  or  at  input 
buffer  depletion  time  if  busy  *=  0 ,  a  read  request  in  the  form  shown  above 
should  be  constructed  as  follows  and  put  at  the  end  of  the  queue 

1.  a  buffer  address  (IB)  is  obtained  from  the  Empty  Input  Buffer 
Stack  (EIS)  and  inserted  into  the  value  field. 

2.  the  count  field  is  set  to  the  standard  input  buffer  length  (L) 

3.  the  refill  field  is  set  to  the  address  of  the  selected  input  File 
Unit  Table  (FUTf ;  2_<  f  5)  . 

Write  Requests  XW,  OB,  M,  FUT1,1 
where  XW  =  Index  word 

OB  =  Output  buffer 
FUT1  =  File  Unit  Table  1 
bit  27  =  1 

When  an  output  buffer  is  filled,  a  write  control  word,  includes  a  1 
in  bit  27,  is  constructed  and  put  on  the  end  of  the  output  I/O  queue. 

This  action  constitutes  a  request  to  write  the  buffer  contents.  The 
value  field  (OB)  of  the  control  word  specifies  the  core  origin  of  the 
buffer;  the  count  field  (M)  specifies  the  record  length,,  The  refill 
field  must  point  to  the  output  File  Unit  Table  (FUT1)  entry.  When 
available,  a  new  current  entry  buffer  must  be  put  on  the  top  of  the 
output  buffer  queue. 

Conditions  Under  Which  I/O  Queue  Tables  Are  Modified 
When  the  current  read  or  write  operation  on  the  channel  has  been 
completed  and  its  control  word  disposed  of,  the  top  word  -  if  any  -  in 
the  I/O  queue  is  moved  to  the  control  word  (word  1)  of  the  same  I/O 
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channel  table  (ICT)  and  the  queue  Is  ‘'popped11.  The  control  word  just 
fetched  is  used  to  begin  execution  of  the  next  read  or  write  on  the 
channel . 

Each  new  request  should  be  made  from  the  top  of  the  queue.  The  con¬ 
tents  of  the  Input  Buffer  Queue  -  including  the  terminator  word  - 
should  be  moved  up  one  word  (popped)  and  the  QP  count  decremented. 
Input  Buffer  Queue  Tables 

There  are  four  input  buffer  queue  tables,  one  for  each  logical  input 
file,  Each  queue  contains  a  control  word  for  each  buffer  currently 
available  in  corea  These  buffers  are  accessed  via  QP2,  QP3,  QP4 , 
and  QP5  -  which  point  respectively  to  the  A,  B,  C,  and  D  input  buffer 
queues.  Each  input  queue  pointer  has  the  form 
QPf:  XW,  BQf,  Nf 

where  QPf  =  QP2,  QP3 ,  QP4,  or  QP5 

BQ^  “  pointer  to  the  queue  origin  and  f  =  2,  3,  4,  or  5 
=  the  current  number  of  entries  and  f  is  same  as  above 
Allocation  of  storage  and  the  presetting  of  QP2,  QP3,  QP4,  and  QP5 
is  performed  by  the  subroutine  INITL*  Each  word  in  the  Input  Buffer 
Queue  table  has  either  the  form: 

XW,  IB.,  L,  FUTf 
where  XW  =  index  word 

IB.  =  input  buffer 

l 

L  =  length 

FUTf  =  File  Unit  Tape,  2_<_  f  <  5 

or  the  form:  a  terminator  word  of  zero  following  the  last  entry. 
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The  top  control  word  in  each  input  buffer  queue  points  to  the  top  of 
the  current  input  buffer.  Merging  is  accomplished  by  examining  the 
current  item  of  each  input  buffer  and  selecting  the  one  with  the  least 
key,  moving  it  (with  output  editing)  to  the  "current"  output  buffer, 
stepping  the  output  buffer  index  register  and  the  buffer  index  of  the 
selected  input.  $A,  $B,  $C ,  $D,  and  $0  are  symbolic  names  for  the 
four  input  index  registers  and  the  output  index  register. 

Conditions  Under  Which  The  Input  Buffer  Queue  Table  is  Modified 

When  an  input  buffer  is  deleted,  its  address  is  returned  to  the  Empty 
Input  Buffer  Stack  (EIS)  pointer  and  the  input  buffer  queue  "popped". 
If  this  EIS  word  is  0,  no  buffer  containing  this  input  is  available. 
The  program  design  allows  this  condition  to  occur  infrequently.  If 
the  input  channel  BUSY  bit  =  0,  the  next  read  on  this  channel  should 
be  preselected,  a  read  request  put  on  the  channel  I/O  is  in  progress. 
Preselection  and  setup  of  the  next  read  should  occur  normally  when 
the  operation  is  completed.  The  merge  should  loop  until  the  queue  is 
eventually  f  0  -  which  indicates  that  a  buffer  is  available. 

If  the  word  at  the  top  of  the  queue  f  0,  it  points  to  a  new  current 
buffer  for  this  input.  If  the  BUSY  bit  -  0 ,  a  read  request  is  pre¬ 
selected,  stacked,  and  the  top  I/O  queue  word  is  unstacked  and  exe¬ 
cuted.  The  input  index  register  should  be  reset  to  point  to  the  new 
queue  and  merging  resumed. 
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Whenever  a  read  is  completed,  LST  in  the  appropriate  I/O  channel  table 
should  be  set  to  the  address  of  the  last  item  in  the  record  which  is 
used  in  preselecting  the  next  input  to  be  read  on  this  channel.  The 
completed  read  control  word  (in  the  I/O  channel  table  word)  is  put  on 
the  end  of  the  input  buffer  queue  and  the  queue-pointer  count  field  is 
incremented  The  next  input  to  be  read  from  the  channel  is  preselected, 
a  read  request  constructed  and  added  to  the  end  of  the  read  queue  and, 
if  BUSY  bit  =  0,  the  top  request  "popped"  and  initiated.  When  an 
input  is  exhausted,  a  dummy  control  word,  HI KEY ,  which  points  to  a  key 
of  bits,  is  put  on  the  end  of  its  input  buffer  queue.  This  control 
word  will  eventually  rise  to  the  top  of  the  queue  and  prevent  that 
input  from  being  selected  and  merged.  Any  other  read  request  for 
this  input  is  deleted  from  the  I/O  queue;  the  buffer  address  is 
returned  to  the  Empty  Input  Buffer  Stack  (EIS) .  Preselection  proceeds 
as  described  above. 

When  an  intermediary  input  reel  -  but  not  a  file  -  is  exhausted,  the 
current  reel  should  be  unloaded  and  the  current  and  alternate  IOD’s 
should  be  switched  (the  IOD  table  is  also  described  in  this  section) . 
Then,  if  the  current  and  alternate  IOD 1 s  are  assigned  to  the  same 
channel,  the  read  is  simply  repeated  using  the  same  control  word  (word 
1  of  ICT)  and  the  new  IOD. 

When  the  end  of  an  intermediary  input  reel  involves  a  channel  address 
change,  because  the  new  current  IOD  has  a  different  channel  address  from 
the  previous  IOD,  considerable  adjustment  is  required  to  purge  the  old 
queue,  and  to  adjust  the  new  one. 
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The  following  operations  are  performed  when  in  the  Autostack  mode. 

1.  Decrement  the  old  OTHER;  increment  the  new  OTHER. 

2.  Delete  all  read  requests  from  the  old  input  I/O  queue,  if 
any,  for  this  input  only  -  returning  the  released  buffers 
to  the  EIS. 

3  Delete  all  read  requests  (for  all  inputs)  from  the  new  input 
I/O  queue,  returning  the  released  buffers  to  the  EIS:  retain 
any  write  requests  on  this  queue. 

4.  Examine  input  buffer  queues  of  all  inputs  now  on  the  new 
channel  A  read  request  is  put  on  this  channel  I/O  queue 
for  any  such  input  whose  '‘current11  buffer  is  empty. 

5.  If  there  is  no  empty  ‘'current"  buffer,  set  LST  for  the 
newly  transfered  item  position  in  the  last  of  its  standby 
buffers.  The  preselection  process  is  then  begun. 

6.  Unstack  and  begin  execution  of  the  next  I/O  request  on  the 
old  channel. 

Empty  Input  Buffer  Stack  (EIS  and  NEIS)  Word  And  Table 
The  NEIS  table  contains  the  available  buffers  that  are  to  be  used  for 
input  I/O.  Access  to  this  table  is  via  the  word  EIS  which  has  the 
following  form: 

XW,  NEIS,  M 
where  XW  =  index  word 

NEIS  =  location  of  the  tdble 

M  *  the  current  number  of  empty  input  buffers;  0  <  M  <_  maximum 
number  of  input  buffers 
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The  Subroutine  IN3TL  computes  the  amount  of  available  core  and  then 
assigns  the  starting  addresses  of  each  input  buffer.  These  results 
are  placed  in  the  NETS  table  and  in  the  one  word  EIS. 

Conditions  Under  Which  The  Empty  Input  Buffer  Stack  Table  is  Modified 

An  enrrv  is  withdrawn  and  tlic  stack  "popped1  each  time  a  read  is  queued 
and  an  entry  is  returned  to  the  NEIS  table;  and  the  stack  is  "pushed" 
each  time  any  input  buffer  is  emptied  or  a  queued  read  is  abandoned  - 
eg  when  a  queue  is  merged  or  at  the  end  of  input.  A  read  is  not 
initiated  unless  the  stack  contains  at  least  one  entry.  Unless  a 
channel  I/O  queue  is  empty,  the  program  may  not  queue  another  read 
for  it  unless  the  number  of  available  buffers  is  greater  than  OTHER 
for  this  channel.  This  restriction  is  intended  to  prevent  a  channel 
from  cleaning  the  stack  and  forcing  delays  on  other  channels  due  to 
lack  of  buffers. 

Output  Buffer  Queue  (QP1)  Table 

This  queue  points  to  a  word  which  points  to  the  buffer  currently  being 
filled  by  the  merge  process  QP1  has  the  form: 

QP1 :  XW5  BQ1,  1 
where  XW  =  index  word 

FQ1  3  pointer  to  the  output  buffer  location  currently  being 

filled  BQ1  has  the  form: 

XW,  OB.,  M. 
j  J 

where  OB.  =  the  location  of  the  output  buffer 
J 

M  =  the  number  of  words  presently  in  the  table 

j 
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Conditions  Under  Which  the  Output  Buffer  Queue  Table  is  Modified 


When  the  output  buffer  OB.  is  filled,  a  control  word  for  this  buffer  is 
constructed  and  put  in  the  proper  I/O  queue  table.  A  new  output  buffer 
is  fetched  from  the  NEOS  and  placed  in  the  value  field  of  BQ±:  the 
count  field  of  BQ1  is  set  to  0 

Empty  Output  Buffer  Stack  (EOS  and  NEOS)  Word  and  Table 

The  second  function  of  the  MERGE  program  is  to  fetch  an  output  bufft-r 

from  the  Empty  Output  Buffer  Stack  (EOS),  construct  a  zero  field  and 

put  it  in  the  top  queue  position  to  be  ready  to  accept  the  first  merged 

item 

The  NEOS  table  contains  the  starting  addresses  of  the  available  output 
buffers.  They  are  accessed  by  the  Empty  Output  Stack  (EOS.)  pointer , 
which  has  the  form 
XW,  NEOS,  N 

where  XW  =  index  register 

NEOS  =  starting  address  of  the  table 

N  =  the  current  number  of  empty  output  buffers,  0  Nj£  maximum 
number  of  buffers  allocated  for  the  MERGE  by  1NIIL. 

Conditions  Under  Which  The  Empty  Output  Buffer  Stack  Is  Modified 

Each  time  the  current  output  buffer  is  filled,  an  entry  is  transferred  into 

the  current  output  buffer  and  the  stack  is  'popped1  If  NEOS  is  empty, 

MERGE  loops  until  a  completed  write  EOP  restores  at  least  one  entry  to  NEOS. 

Whenever  a  write  is  completed,  the  released  buffer  address  should  be 

returned  to  the  EOS  and  the  stack  "pushed" . 
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IOD  Table 


This  sixteen-register  associates  an  IOD  with  an  I/O  channel  table, 
which  is  directly  accessed.  Each  entry  has  the  symbolic  base  address 
lODd  where  d  corresponds  to  a  possible  merge  tape  IOD(MCP  reference 
number).  It  may  also  be  accessed  via  File  Unit  Table  entry  (FUTf ;  q.v.). 
The  format  of  IOD  is: 


IODd : 

VF,  d; 

VF,  0  (if  no  merge  tape  has  IOD  assignment  d) 

or 

IOD  : 

VF,  d; 

VF,  ICTK 

where  VF  =  value  field 

d  =  IOD  number  0  <_  d  <_  15 

ICTK  =  an  index  address  corresponding  to  the  7030  tape  channel  to 

which  the  tape  drive  is  connected;  where,  normally,  1  <  K  <4 ; 
ICT  is  the  I/O  channel  table. 

The  subroutine  INITL  looks  up  each  merge  tape  channel  assignment  in  the 
IOD's  HOP  unit  area  table  and  enters  the  base  address  of  the  channel  I/O 
Control  Table  (ICT)  in  the  right  half  of  the  IOD  table  entry.  These 
assignments  do  not  change. 

File  Unit  Table  (FUTf) 

The  File  Unit  Table  associates  each  logical  input  and  output  with  its 
current  and  alternate  tape  drive  and  indirectly  with  the  I/O  channel 
table.  Each  entry  of  the  table  is  directly  addressable.  By  convention, 
f  =  1  corresponds  to  the  output  file;  f  =  2  corresponds  to  the  first 
input  file  (INPUT  A) ,  f  =  3  corresponds  to  the  second  input  file,  etc. 
The  FUT  may  be  accessed  also  by  the  refill  field  of  any  control  word. 
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The  format  is  as  follows: 


FUTf :  VF,  lODd ;  VF,  IODd' 

where  IODd  =  the  IOD  table  entry  for  the  logical  files  current  tape  drive 

IODd*  *  the  entry  for  the  logical  file's  alternate  tape  drive,  if  any. 
If  only  one  tape  drive  is  assigned,  then  IODd  and  IODd*  are  equal. 

Whenever  an  EE  occurs  the  half-words  are  switched:  The  final  EE  points 
to  IOD0  which  always  has  the  form: 

IOD0:  VF,  0;  VF,  0 

indicating  that  no  tape  unit  is  attached  for  that  file. 

BLIM1  and  BLIM5  Tables 

These  tables  perform  the  same  function  for  the  current  buffer  limiting 
address  that  FUT1  -  FUT5  perform  for  the  input/output  files.  This  value 
enables  the  program  to  identify  the  last  item  in  a  buffer;  no  initiali¬ 
zation  is  required  Modification  of  the  table  occurs  each  time  a  new  buffer 


is  added. 


Input  Edit 


The  edit  program  preprocesses  the  tape  record  into  item  blocks  whose 
format  is  compatible  with  MERGE.  The  keys  of  each  item  are  extracted 
and,  if  requested,  are  converted  into  a  modifiable  collated  key.  These 
keys  are  stored  in  the  48-bit  mantissas  of  the  words  preceding  the  item. 
The  last  24  bits  of  the  mantissa  of  the  appended  words  contain  an  item 
number  which  is  always  one  more  than  the  previous  one.  For  example, 
a  record  containing  14  items  of  210  characters  (6-bit  code)  each  would 
be  processed  as  illustrated  in  Figure  2.  Assume  2  keys  in  characters 
5-7,  10-19. 


18  bits 


ITEM  #1 


ITEM  #2 


ITEM  #14 


r 


< 


> 


#  of  words 

0  —  0 

5 

6 

7 

10 

11 

12 

13 

14 

0  —  0 

o  —  0 

15 

16 

17 

18 

19 

0 

0 

0 

o  —  o 

0 

ITEM  # 

n  _  n 

u  —  U 

U 

u  —  U 

ITEM  =  210  characters 


Three 

appended 

words 


BLOCK  =  23  words 


(FILLER) 

0  -  0 


23  words 


23  words 

o - 

-  0 

Figure  2.  Format  of  Appended  Key  Words 
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Since  the  48-bit  mantissa  of  the  second  appended  word  is  not  large 
enough  to  contain  the  key  characters  and  the  24-bit  item  number s  the 
item  number  is  right  justified  in  the  mantissa  of  the  third  appended 
word.  The  pointer  is  moved  23  words  each  time  a  new  item  is  required, 
the  23rd  word  of  each  block  contains  a  20-bit  filler  of  zeros.  The 
preprocessor  is  capable  of  handling  general  input  tapes  The  description 
of  the  tapes  and  the  preprocessing  desired  is  specified  in  the  data 
control  cards,  described  in  the  section  entitled  CONTROL  DECK  ARRANGEMENT 
The  collating  table  is  stored  in  common  storage  and  may  be  modified  by 
a  block  data  deck,  described  in  the  FORTRAN  Manual.  The  standard 
collating  sequence  is: 

blank,  0,  1,  2,  3,  4,  5,  6,  7,  8,  9,  =,  ?,  +,  A,  BP  C,  D  E 

F,  G P  H,  I,  ),  -,  J,  K,  L,  M,  N,  0,  P,  Q?  R,  *,  /,  S, 

T,  U,  V,  W,  X,  Y,  Z,  , ,  (. 

During  initialization  the  program  sets  up  the  code  to  preprocess  the 
input  tapes. 

Output  Edit 

The  program  takes  the  item  selected  by  the  merge  process,  sequence 
checks  it  for  error,  strips  the  appended  key  words  (in  the  example  in 
Figure  2,  the  first  three  words),  and  stores  the  item  in  the  buffer. 

The  program  will  also,  upon  request,  eliminate  duplicates.  Output 
tape  and  editing  are  specified  as  desired  through  the  data  control  decku 
During  initialization  these  parameters  are  used  to  generate  the  code 
to  be  used  for  input  processing. 
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SECTION  III 


SUBROUTINES 

MAIN  -  This  subroutine  is  coded  in  FORTRAN;  its  purpose  is  to  call 
INITL  and  MERGE  routines. 

INEDIT  -  This  subroutine  is  called  when  the  program  is  in  autostack 
mode  at  EOP  time  of  a  read  input  tape  request.  In  this 
phase,  the  preprocessor  extracts  the  sort  keys,  converts 
the  data  according  to  a  collating  sequence  -  if  this  option 
is  requested,  and  rearranges  the  data  into  a  form  compatible 
with  the  MERGE  operation. 

OTEDIT  -  The  output  edit  is  called  by  the  mainstream  program  when 
placing  the  item  in  the  output  buffers.  The  output  edit 
strips  the  sort  keys  from  the  item  and  packs  the  item  into  the 
buffer.  Upon  request,  the  item  will  be  eliminated  if  it  is 
a  duplicate  of  a  previous  one. 

MERGE  -  This  program  performs  all  of  the  merging  and  input/output 

operations;  INEDIT  and  OTEDIT  are  called  by  MERGE  when  needed. 

The  following  words  and  routines  are  all  entries  incorporated  into  one 

so-called  subprogram  which  must  be  placed  after  MERGE;  INITL  calls  these 

entries  as  needed. 

FUTF  -  adds  IOD0  to  every  half-word  in  the  FUT  tables. 

IODF  -  Locates  the  proper  channel  and  stores  ICT  pointers  in  the  right 
half-word  for  each  unit  in  the  IOD  table  that  is  being  used. 

ZEROA  -  Initializes  tables,  clears  QP  tables,  FUT  tables,  and  ICT  tables 
and  stores  zeros  in  each. 
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FIXDT  -  Arranges  common  storage  words  in  the  proper  order  for  general 
parameters . 

INFIX  -  Arranges  common  storage  words  in  the  proper  order  for  input 
parameters . 

OUTFIX  -  Arranges  common  storage  words  in  the  proper  order  for  output 
parameters . 

SEOS  -  sets  up  pointers  for  EIS  and  NEIS. 

FXAF  -  sets  up  LSTA  -  D  words  to  point  to  HIKEY. 

FBUF  -  computes  length  of  available  buffer  and  table  space. 

WDCON  -  converts  A8  code  formats  into  proper  MERGE  format;  used  for 
filler  and  terminator  bits. 

OTHER  -  contains  the  number  of  input  files  not  presently  on  a  specific 
channel . 

INITL  -  This  program  extracts  the  parameters,  and  initializes  the  common 
storage  variables.  When  the  initialization  phase  is  over,  the 
core  storage  areas  occupied  by  the  programs  FUTF  to  INITL  are 
used  as  buffer  space.  Thus,  all  space  in  core  following  the 
MERGE  program  and  preceeding  common  storage,  is  occupied  by 
tables . 

IBIN ,  IXTC,  and  K0MP1  -  These  routines  are  described  in  the  7030  Facility 

Ma  nua 1 . 
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List  of  Subroutines 


MERGE  subroutines  are  entered  in  the  following  order.  Since  some  of 

the  routines  are  cleared  out  of  core  after  being  used,  the  order  is 

of  utmost  importance. 

MAIN 

INEDIT 

OTEDIT 

MERGE 

FUTF 

IODF 

OTHER 

ZEROA 

FIXDT 

SEOS 

FXAF 

FBUF 

WDCON 

INFIX 

OUTFIX 

INITL 

IBIN 

IXTC 

K0MP1 
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SECTION  IV 


COMMON  STORAGE 


Tables 


The  following  is  a  description  of  the  tables  to  be  found  in  common 
storage . 

IODO  -  references  the  input/output  device 

FUT1  -  FUT5  -  associate  logical  input/output  with  current  and 
alternate  tape  drives 

BLIM1  -  BLIM5  -  identifies  address  of  last  item  in  the  buffer 

ICT1  -  ICT4  -  perform  housekeeping  for  the  input  channel  tapes* 
each  table  contains  seven  words  whose  functions  are 


WORD  BIT  FUNCTION 


0 

0 

1 


2 

3 


4 

3 

6 

7 


0  channel  busy  bit  indicator 

32-49  number  of  input  files  not 

on  this  channel  (OTHER) 

0-63  current  control  word  (CW) 

performing  I/O;  bit  27  used 

to  indicate  a  read  or  write 

spare 

0-17  points  to  channel’s  I/O  queue 

28-45  number  of  entries  in  the  queue 

(QP) 

0-23  bit  address  of  Last  File  A-D 

0-23  item  -  used  for  preselection 

0-23  (LSTA  -  LSTD) 

0-23 


The  rest  of  the  tables  are  used  to  accumulate  statistics  of  certain 


timing  procedures. 

QP1  -  QP5  -  pointers  to  buffer  queues 
NEIS  -  Input  buffer  table  (see  EIS) 
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NEOS  -  Output  buffer  table  (see  EOS) 

SCHR  -  bits  0-23  -  start  relative  bit  for  duplicate  comparison 
bits  32-55  -  length  to  compare  (64  reg.) 

NCHR  -  64  words  used  as  intermediate  storage 
IEXP  -  6  spare  words 

SKEY  -  bits  0-23  relative  bit  position  of  start  key 
bits  32-49  length  of  key  in  bits  -  64  words 
Words  In  Common  Storage 


WORD  NAME 

FUNCTION 

BIT  POSITION 

EIS 

Input  buffer  stack  pointer  (NEIS) 
starting  address  of  NEIS  table 
number  of  enteries  in  NEIS 

0-17 

28-45 

EOS 

Output  buffer  stack  pointer  (NEOS) 
starting  address  of  NEOS  table 
number  of  enteries  in  NEOS 

0-17 

28-45 

L 

Input  buffer  length 

0-17 

M 

Output  buffer  length 

0-17 

IL 

Input  item  length  (-sign) 

0-23 

ILLA 

Input  item  length  (+sign) 

0-23 

OUTIND 

Output  Edit  Indicator 

0 

JSKPLB 

First  record  skip  indicator 

0 

NITEDT 

Input  Edit  Indicator 

0 

JPTPLB 

Label  for  output  tape  indicator 

0 

LABEL 

The  BCD  label 

0-63 

HIKEY 

A  special  stopper  key  of  all  l's 

12-59 

NWDSK 

Number  of  key  words  for  the  out¬ 
put  edit  to  discard 

32-49 
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Words  In  Common  Storage  (Cont'd) 


WORD  NAME 

MNOREC 

MXOREC 

IOVC 

IOVBS 

OVNB 

OVTL 

OVTB 

OVFL 

BUFED 

NCOMP 

LMA 

BUFIN 

MINREC 

MAXREC 

INBS 

INVC 


FUNCTION 


Minimum  number  of  bits  for 
output  items 

Maximum  number  of  bits  for 
output  items 

Item  type 

Output  item  byte  size 

Number  of  length  control  bytes 
number  of  bytes  in  fill 
configuration 

Length  bytes  contents  are  binary 

or  decimal  indicator 

first  length  bytes'  relative 

Termination  bytes  (right 
justified) 

Fill  bytes  (right  justified) 

Starting  address  of  last  item 
processed 

Number  of  sort  keys  for  elim¬ 
ination  of  duplicate  items 

Bit  size  of  output  record 

Address  of  temporary  storage  to 
be  used  at  EOP  time  for  pro¬ 
cessing  input  records 

Minimum  input  record  size 

Maximum  input  record  size 

Input  item  byte  size 

Item  type 


BIT  POSITION 
32-55 

32-55 

32-49 

32-49 

0-17 

32-49 

0-17 

32-55 

0-63 

0-63 

0-17 

32-49 

0-23 

0-17 

0-23 

0-23 

0-17 

0-17 
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WORD  NAME 

FUNCTION 

BIT  POSITION 

IVNB 

Number  of  bytes  in  length 
control  (VT  or  VL) 

0-17 

INCT 

Mode  of  data  binary  or  decimal 
origin  of  length  control  (VL) 

0-17 

32-55 

IVTB 

Terminator  byte  right  justified 
(VT) 

0-63 

ISTD 

Merging  forward  or  backward 

0 

ICOLL 

Collate  indicator 

0 

NKEY 

Number  of  input  sort  keys 

0-17 

IOVFC 

Fill  to  maximum  length  indicator 

32-49 
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SECTION  V 


CONTROL  DECK  ARRANGEMENT 

There  are  two  sources  of  merge  parameters  in  a  MERGE  Job  Deck: 

1  The  FIOD  deck  -  The  MCP  REEL  cards  behind  each  input  tape 
IOD  card.  For  each  physical  input  unit  these  REEL  cards 
specify  the  number  of  reels  to  be  merged,  the  reel  labels, 
and  the  loading  order  of  the  reel  The  MERGEP1  card  relates 
each  file  to  the  proper  physical  unit(s).  The  job  is  usually 
run  as  a  COMPILGO  since  the  input  arrangement  may  be  varied 
2,  Two  or  more  parameter  cards  that  follow  the  "SUBTYPE  DATA" 

card  in  the  System  Card  Reader.  Each  parameter  card  contains 
either  MERGEP1,  MERGEP2,  or  MERGEP4  in  columns  1-7.  Columns 
73-80  of  each  card  are  ignored  by  the  merge  program  and  may  be 
used  in  the  usual  way  for  external  identification.  Columns 
9-72  contain,  from  left  to  right,  1-16  parameter  fields,  each 
0-8  characters  long,  separated  by  commas.  For  each  field, 
leading  and  trailing  blanks  are  ignored.  Null  and  blank  fields 
are  equivalent 
MERGEP1  Card 

This  card  defines  essential  input  parameters,  indicated  in  the  following 
list 
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FIELD  # 


CONTENTS 


LEGAL  VALUES 


VALUE 

ASSUMED  IF  NULL 


1 


2 


3 


4 


Maximum  input  record 
length  (bytes) 


Input  data  byte  size 


Should  the  merge 
skip  the  1st  record 
on  each  input  tape? 

Merge  item  length 
in  words . 


Any  item,  n,  such 
that  n*byte  sice 
_>192  bits 

1  to  64 
W  =  64 
B  =  1 
C8  =  8 
C,C6  =  6 

Blank  =  No 
Nonblank  =  Yes 


Integer 


5  Is  an  input  Edit 
required? 

6  Number  of  logical 
inputs  (M)  to  be 
merged 

7,  9,  etc  For  each  logical  in-  Integer 

put,  the  IOD  (refer¬ 
ence  number  can  be 
obtained  by  examining 
position  in  FIOD  deck) 
of  tape  drive  on  which 
odd-numbered  reels  of 
this  input  (1st,  3rd, 
etc.)  are  to  be  mounted. 

8,10,12,  etc.  IOD  of  the  tape  drive  Integer 
on  which  even  num¬ 
bered  reels  (2nd,  4th 
etc.)  of  this 
input  are  mounted. 


Blank  =  No 
Nonblank  =  Yes 

2  <M  <4 


May  not  be  null. 
Not  applicable 
to  FORTRAN . 

6 


No . 


Meaningless  if  an 
input  edit  is 
specified.  In 
that  case  this 
value  is  com¬ 
puted  . 

No. 


May  not  be  null. 


Not  a  1 lowed . 


Not  allowed. 
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MERGEP2  Cards 


These  cards  have  two  different  formats  which  define  parameters  for 
the  standard  edit  routine  -  they  are  not  needed  if  input  editing  is 


not  desired .  These  cards  are  the  same  as  the  S0RTP2  parameters. 


FIELD  #  CONTENTS  LEGAL  VALUE 

F.FBIN  VFBIN  VL  ,VT 


VALUE  ASSUMED 
IF  NULL 


1 

2 


3 


1 

2 


1 

2 


3 


Desired  output 
Order 

Should  key  bytes 
be  converted  by 
table  lookup 
before  sorting: 

Type  of  item 


A  =  ascending 
D  =  descending 

S  =  Yes 
N  =  No 


Ascending 


No  conversion 


F  =  fixed  Fixed  length 

length 

VL  =  variable 
length , 
speci fied 
in  a  field  of 
each  item 

VT  =  variable  length, 
marked  by 
termination  bytes 
FBIN  =  fixed  FORTRAN 
B  or  U  records 
VFB'IN  =  variable 

length  B  records 


4  4 


5 


Fixed  or 

1  to  maximum  input 

6  words 

minimum  input 
items  length 
(bytes) 

record  length 

Maximum  input 

1  to  maximum 

6  words 

item  length 

input  record 

(bytes) 

length 

Type  VL 

6  Mode  of  field  B  =  binary 
containing  length  integer 

D  =  decimal 

1  digit/byte 


Binary 
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FIELD  # 

F  ,FBIN  VFBIN 

VL  ,VT 

CONTENTS 

LEGAL  VALUE 

VALUE  ASSUMED 
IF  NULL 

7 

Relative  origin 
of  1st  length  - 
containing  byte 
(in  bytes) 

1  to  minimum  item 
length 

Not 

applicable 

8 

Type 

VT 

Length  in  bytes 
of  field 

1  to  minimum 
item  length 

Not 

app licab le 

b 

Entry  mode  of 

terminator 

configuration 

A  =  A8  or  A6 

0  -  Octal ; 
last  three  bits 
of  each  input  byte. 

A 

7 

Terminator 

configuration 

Any  value  legal 
for  mode 

Not 

applicab le 

8 

Length  in  bytes 
of  terminator 

1  to  maximum  item 
byte 

Not 

applicable 

5  6  9 

Remaining  MERGEP2  Cards 

Number  of  key 
fields 

1  to  64 

1 

FIELD  # 

CONTENTS 

LEGAL  VALUE 

VALUE  ASSUMED 
IF  NULL 

1 

Relative  origin 
of  1st  key  field 
(byte) 

1  to  minimum 
item  length 

Not 

applicable 

2 

Length  of  1st 
key  field 
(byte) 

1  to  minimum 
item  length 

ii 

3- A 

Second  key 
speci fication 
particular  s 

n 

1 1 

etc  . 

etc . 

etc . 

ti 
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Additional  keys  may  be  input  on  MERGEP2  Cards  as  necessary  to  contain 
pairs  of  key  specifications  parameters. 

MERGEP3  Cards 


Essential  output  parameters  are  defined  by  MERGE3  cards. 


FIELD  # 

CONTENTS 

LEGAL  VALUE 

VALUE  ASSUMED 
IF  NULL 

1 

Maximum  output 
record  length 

Any  item,  n, 
such  that  n*byte 
size  ^  192  bits. 

May  noc  br  ruj  1 
Not  applicable 
to  FORTRAN 

2 

Output  data 
byte 

1  to  64 

W  =  64 

B  =  1 

C8  =  8 

C6,C  =  6 

6 

3 

Should  the  merge 
create  a  dummy 
label  record  on 
each  output  tape? 

Blank  =  No 

Nonblank  =  Yes 

(up  to  8  characters) 

No 

4 

Is  an  output 
edit  required? 

Blank  =  No 

Nonblank  =  Yes 

No 

5 

IOD  of  the  tape 
drive  on  which 
the  odd-numbered 
output  reels  (1st 

3rd,  etc)  are  to 
be  written. 

Integer 

Not  allowed. 

6 

IOD  of  the  tape 
drive  on  which  the 
even  numbered  out¬ 
put  reels  (2nd,  4th 
etc.)  should  be 
written  if  different 
from  3  above 

Integer 

Not  allowd 

MERGEP4  Cards 

This  MERGEP4  card  group  defines  parameters  for  the  standard  output  edit 
routine;  it  is  not  needed  if  output  editing  is  not  desired 
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FIELD  # 

F ,FBIN  VFBIN 

VL, 

VT 

CONTENTS 

LEGAL  VALUES 

VALUE  ASSUMED 
IF  NULL 

i  i 

l 

Type  of  item 

F=fixed  length 
VL=variable 
length  specified 
in  a  field  of 
each  item. 

Fixed  length 

VT=variable 
length  marked 
by  terminator 
byte (s) 

FBIN-f ixed  FORTRAN 

B  or  U  records 
VFBIN=variab le 
length  B  records 

2  2 

2 

Key  length 

Integers:  ignored 

if  input  edit 
is  specified. 

Not 

appli cable 

3  3 

3 

Fixed  or 
minimum 

1  to  maximum 
output  record 
length 

6  words 

4 

4 

Maximum  output 
item  length 
(bytes) . 

1  to  maximum  out¬ 
put  record  length 

6  words 

Type 

VL 

5 

Mode  of  field 
containing  length 

B  =  binary  integer 

Binary 

6 

Relative  origin 
of  1st  length 
containing  byte 

1  to  minimum 
item  length 

Not 

applicable 

7 

Length  of  field 
containing  length 
(in  bytes) 

1  to  minimum 
item  length 

Not 

app  I  icab le 

Type 

VT 

5 

Entry  mode  of 
terminator 
conf igurat ion 

A=A6,  A8 

0=0ctal 

A6  or  A8 
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F ,FBIN 


4 


Second 


FIELD  # 

VFBIN  VL ,VT 

6 

7 

Both 
VT  and  VL 


CONTENTS 


Terminator 

configuration 

Length  of  bytes 
of  terminator 


LEGAL  VALUES 


Any  value 
legal  for  mode 

1  to  maximum 
item  length 


5  8  Mode  of  fill  A=A6 ,  A8 

configuration  0=0ctal 
of  item  is  to 
maximum  length. 


6 


7 


Fill  configuration 


Any  value 
legal  for  mode. 


10  Length  in  bytes 
of  fill  con¬ 
figuration 


1  to  maximum 
size 


8  11  Number  of  fields  1-64 

to  be  compared  for 
duplicate  elimin¬ 
ation 


Type  MERGEP4  Card 


1 


2 

3,4 


etc . 


Relative  origin  1  to  minimum 

of  first  key  item  length 

field  (1  byte) 


Length  of  1st  1  to  minimum 

key  field  (bytes)  length  origin 


Second  key  speci¬ 
fications  para¬ 
meter 


etc . 


etc . 


VALUE  ASSUMED 
IF  NULL 


Not 

applicable 

Not 

appl icabl e 


No  fill 


Not 

applicable 


Not 

applicable 
do  not  check 
for  duplicate 
removal 

it 


Not 

applicable 


Not 

applicable 


etc . 
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Subsequent  MERGEP4  cards  may  be  used  as  necessary  to  contain  pairs 


of  key  specifications  that  will  not  fit  on  the  current  card, 
last  card  of  the  deck  is  "END"  in  columns  1-3. 


The 


-31- 


D:C 


APPENDIX 


Al 


-33- 


FOUR-WAY  COMPARISON  AND  RETURN  TREE 


Security  Classification 


DOCUMENT  CONTROL  DATA  -  R  &  D 


(SfCurifi*  <  Ins  situ  ation  of  body  ol  nbstrut  t  mid  indexing  annotation  must  he  entered  when  the  overall  report  is  c Inxsi lietlj 


1  originating  activity  (Corporate  author) 

The  MITRE  Corporation 

Bedford,  Massachusetts 

U«.  REPORT  SECURITY  C  L  A  SSI  f  1  C  a  I  1 0  N 

Unclassified 

2b  GROUP 

3  REPORT  TITLE: 

MERGE 

4  fFSCRIP  Ti  VE  NOTES  (T^’pe  ot  report  and  jnc/udive  date  s ) 

N/A 

6  auThORiS  )(  First  name,  middle  initial,  Ja  a  t  name) 
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0 
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IT  ABSTRACT 


A  general  purpose  7030  computer  program  has  been  written  in  the  FORTRAN 
environment  to  merge  in  a  single  run  of  the  program  two,  three,  or  four  tape  files 
which  are  the  resulting  output  from  a  sort  program.  These  files  may  be  more  than 
one  tape  in  length  and  may  be  either  FORTRAN  or  non-FORTRAN  formatted.  Input 
and  output  processors  have  been  designed  to  handle  various  types  of  item  formats; 
the  program  accepts  items  of  fixed  length,  variable  length  -  identified  by  a  pre¬ 
designed  terminator  -  and  variable  length  with  a  length  defining  field.  Upon  request 
the  output  processor  will  eliminate  duplicate  items  and/or  fill  in  variable  length 
records  to  a  standard  size. 
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